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Summary Objective: Human aminopeptidase N (APN/CD13/ANPEP) has been iden¬ 
tified as the receptor for human coronavirus (HCoV) 229E. In this study, we analyzed 
the region of the APN gene that encodes a stretch of amino acid residues, essential 
for its HCoV-229E receptor function (amino acids 260-353). 

Methods: Full-length APN exon 3, intron 3 and exon 4, was PCR-amplified and se¬ 
quenced in DNA samples from 100 unrelated Caucasian Belgian healthy volunteers. 

Results: We identified seven polymorphisms, including four intron 3 and three exon 
4 variations. Apart from the already known C956T exon 4 mutation, the six other 
polymorphisms have not yet been described. The most prevalent APN variations in 
this population (C956T leading to an alanine to valine substitution, G978T, G987A and 
intron3-C389T) always occurred together at an allele frequency of 8.5%. Haploid DNA 
sequencing demonstrated the presence of these four variations on the same allele. 
Three polymorphisms in intron 3, intron3-G395C, intron3-C86T, and intron3-C429T, 
were identified with an allele frequency of 3.5%, 1% and 0.5% respectively. Five hap- 
lotypes were identified in the population of 100 individuals. 

Conclusion: These results demonstrate that there is a relatively broad spectrum of 
variations in the APN domain critical for coronavirus binding. 

The nucleotide sequence reported here has been submitted to the GenBank database 
with the following accession number: AF527789. 

© 2004 International Society for Infectious Diseases. Published by Elsevier Ltd. All 
rights reserved. 
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Introduction 

Coronavirusesare large, enveloped, single-stranded, 
positive-sense RNA viruses, with a genome size of 
approximately 30 kb. Human coronaviruses (HCoV), 
with two known serogroups designated OC43 and 
229E, are an important cause of upper respiratory 
tract illnesses, and are also implicated in diseases 
involving the digestive and the central nervous 
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system. 1 After the recent SARS outbreak and the 
discovery of a new coronavirus (SARS-CoV) as the 
causative agent, the importance of coronaviruses 
in human medicine can no longer be neglected. 2 

As a first step in viral entry into the cell, hu¬ 
man coronavirus 229E (HCoV-229E) attaches, in 
a species-specific manner, to aminopeptidase 
N (APN/CD13/ANPEP). 3 In other species APN is 
also used as a receptor by the HCoV-229E re¬ 
lated group I coronaviruses (porcine transmissible 
gastroenteritis virus, porcine epidemic diarrhea 
virus, feline infectious peritonitis virus and ca¬ 
nine coronavirus). 4,5 Although human APN had also 
been suggested as a putative receptor for the spike 
glycoprotein of the SARS-associated coronavirus 
(SARS-CoV), it has now been proven that not APN 
but angiotensin-converting enzyme 2 is a functional 
receptor for SARS-CoV. 6,7 The receptor for human 
coronavirus serotype OC43 (HCoV-OC43) is still un¬ 
known, although evidence has been presented that 
9-O-acetylated sialic acid is used as a receptor 
determinant for infection of human cells. 8 

APN is a type II glycoprotein that belongs to the 
family of membrane-bound metalloproteases. The 
150kDa APN glycoprotein (EC 3.4.11.2) is com¬ 
posed of 967 amino acids and is expressed in a 
variety of tissues including cells of the monocytic 
and granulocytic lineage, synaptic membranes of 
the central nervous system, and intestinal, lung 
and kidney epithelial cells. 9 A soluble form of APN 
is reported to be present in human plasma. 10 The 
physiological role of APN includes the conversion 
of oligopeptides in the small intestinal lumen into 
amino acids, and an involvement in the degrada¬ 
tion of regulatory peptides in other tissues such as 
the kidney and the brain. 9 The human APN gene has 
been mapped to chromosome 15q25-26, and the 
coding part of the gene is divided in 20 exons. 11,12 
Expression of APN is regulated by two different 
promoters, separated by an 8kb intron, leading to 
the formation of two transcripts that only differ in 
their 5' non-coding region. The epithelial promoter 
is located close to the coding part of the gene 
and is active in enterocytes and other epithelial 
cells, while the more upstream myeloid promoter 
is mainly active in myeloid cells. 13 Two splice vari¬ 
ants in which exon 3 and exon 14 were lost, have 
recently been identified, resulting in both cases in 
a carboxy-terminal truncated protein. 14 

Several applications of APN in therapy and diag¬ 
nosis have been described. APN/CD13 is frequently 
used in the characterization and typing of leukemia 
or lymphoma cells, as APN/CD13 is expressed on 
stem cells and during most developmental stages of 
myeloid cells. 15 It has recently been demonstrated 
that APN/CD13 is both a receptor for tumor homing 


peptides and a functional regulator of angiogene¬ 
sis, implicating a role for APN/CD13 as a target for 
anti-angiogenic therapy and tumor targeted drug 
delivery. 16,17 APN has also been shown to mediate 
human cytomegalovirus (HCMV) infection. 18 The in¬ 
hibition or neutralization of HCMV infection by sol¬ 
uble APN implicates the possibility of using soluble 
APN as an antiviral agent. 

The domain of human aminopeptidase N that is 
essential for its HCoV-229E receptor activity has 
been assigned to APN amino acids 260—353, with 
a critical stretch of eight residues (amino acids 
288—295). 19,20 This region is encoded by the ma¬ 
jor part of APN exon 3, whole exon 4 and a small 
part of APN exon 5 (nt 802—1083, hAPN mRNA, 
GenBank accession number XI3276). In this study 
we examined whether polymorphisms could be de¬ 
tected in the HCoV-229E binding domain of APN in a 
Caucasian population of 100 unrelated, healthy in¬ 
dividuals, assuming that these mutations could be 
of importance in HCoV-229E attachment to human 
cells. 


Materials and methods 
Subjects 

The study involved a Caucasian population of 100 
healthy, unrelated individuals from the region of 
Flanders, the northern part of Belgium. Informed 
consent was obtained from all participants and the 
study had the approval, based upon the guidelines 
from the World Medical Association’s Declaration of 
Helsinki, of the Ethics Board of the University of 
Leuven. 21 

DNA extraction 

DNA samples were collected through a non-invasive 
’swish-and-spit’ technique. Genomic DNA was ac¬ 
quired from oral epithelial cells, by rinsing the oral 
cavity with a 0.9% saline solution, after which DNA 
was extracted using an alkaline lysis procedure. 22 

PCR amplification 

An 871 bp fragment, encompassing exon 3, intron 
3 and exon 4 of the aminopeptidase N gene, was 
amplified by polymerase chain reaction (PCR), 
using S'-TGCTTCCCAAAGGTGAGTGG-S' as the for¬ 
ward primer and S'-CCATTGGCAGGATGAACTCC-S' 
as the reverse primer (GenBank accession number 
AC018988). PCR amplification was performed in a 
volume of 50 (jlL reaction mix, with a concentration 
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of 0.2 |xM of forward and reverse primer, 0.2 mM of 
nucleotides, 1.5 mM of MgCl 2 and 1 unit of Taq poly¬ 
merase (Applied Biosystems/Roche Molecular Sys¬ 
tems, Belgium) at pH 9. PCR conditions were com¬ 
posed as follows: an initial denaturation at 95 °C for 
five minutes, then 40 cycles of 30 seconds at 95 °C, 
30 seconds at 57 °C and 35 seconds at 72 °C, and a 
final elongation at 72 °C for seven minutes. The am¬ 
plification reaction was performed in a Geneamp® 
PCR System 9700 thermal cycler (Applied Biosys¬ 
tems, Foster City, CA, USA). The resulting PCR 
products were visualized after electrophoresis, on 
an ethidium bromide-stained polyacrylamide gel. 

Sequence analysis 

After purification of the PCR products with the 
QIAquick PCR purification kit (Qiagen, Westburg, 
The Netherlands), the purified products were 
cycle sequenced in forward and reverse direc¬ 
tion using the ABI PRISM BigDye Terminator Cycle 
Sequencing Ready Reaction kit (Applied Biosys¬ 
tems, Foster City, CA, USA). Besides the PCR 
primer set, two additional primers, a forward: 
5'-CTGCCCCAGGATCAAACAGG-3' and a reverse: 
5 , -GCACAGGGATGAAGAGAACG-3\ both located in 
APN intron 3, were used to obtain the full-length 
sequence of the 871 bp fragment (GenBank ac¬ 
cession number AJ421875). Electrophoretic sep¬ 
aration and detection were performed on an ABI 
PRISM 3100 Genetic Analyzer (Perkin Elmer, Applied 
Biosystems, Foster City, CA, USA). 

Haplotype analysis 

PCR products from individuals heterozygous for 
more than one variation were cloned into the 
pDrive cloning vector (Qiagen PCR Cloning kit) using 
One Shot Max efficiency DH5a-T1 competent cells 
(Invitrogen, Merelbeke, Belgium). Plasmids were 
purified with the Qiaprep miniprep spin kit (Qia¬ 
gen, Westburg, The Netherlands) and checked for 
inserts by EcoRI restriction analysis. Bidirectional 
cycle sequencing of plasmids with inserts was per¬ 
formed with the PCR primer set using the ABI PRISM 
BigDye Terminator Cycle Sequencing Ready Reac¬ 
tion kit (Applied Biosystems, Foster City, CA, USA). 
Electrophoretic separation and detection were 
performed on an ABI PRISM 3100 Genetic Analyzer 
(Perkin Elmer, Applied Biosystems, Foster City, CA, 
USA). 

Results 

A total of 100 healthy unrelated Belgian individu¬ 
als were screened for polymorphisms in the human 


aminopeptidase N domain that is essential for its 
HCoV-229E receptor activity. An 871 bp fragment, 
encompassing APN exon 3, intron 3 and exon 4 was 
PCR-amplified and sequenced, and was submitted 
to GenBank under accession number AF527789. The 
resulting chromatograms were analyzed using the 
SeqMan multiple sequence alignment tool (Laser- 
Gene, DNAStar, Madison, Wl). Consensus sequences 
were compared with a reference APN sequence in 
GenBank (accession number AJ421875) using BLAST 
(Basic Local Alignment Search Tool). 23 

Seven variations were identified, of which six 
have not yet been described. Table 1 shows the 
distribution of the allele frequencies. Intron vari¬ 
ations are described referring to their relative 
position in the intron, and exon variation positions 
are referred to their mRNA nucleotide position 
(GenBank accession number XI3276). The location 
of the polymorphisms in the APN gene is shown in 
Figure 1 . While no variations could be found in APN 
exon 3, four of the identified polymorphisms were 
located in intron 3 and three in exon 4. In intron 3, 
a C to T variation was found at intron 3 position 86 
(C86T) (GenBank AJ421875, contig position 327) in 
two individuals, as well as a G to C variation at in¬ 
tron position 395 (G395C) in seven individuals, and 
a C to T variation at intron position 429 (C429T) 
in one sample. All individuals were heterozygous 


Table 1 Allele 
mutations. 

frequencies of 

the 

APN gene 

Position 

Variation 

Frequency 

n % 

Intron 3 a 

86 

C 

198 

99.0 


T 

2 

1.0 

389 

C 

183 

91.5 


T 

17 

8.5 

395 

G 

193 

96.5 


C 

7 

3.5 

429 

C 

199 

99.5 


T 

1 

0.5 

Exon 4 b 

956 (codon 311) 

C 

183 

91.5 


T 

17 

8.5 

978 (codon 318) 

G 

183 

91.5 


T 

17 

8.5 

987 (codon 321) 

G 

183 

91.5 


A 

17 

8.5 


a Variations in intron 3 are referred to their position 
in intron 3. 

b Variations in exon 4 are referred to their mRNA po¬ 
sition (GenBank accession number XI3276). 
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Figure 1 Schematic representation of the human aminopeptidase N (APN) gene. M represents the myeloid promoter, 
and E the epithelial promoter. The primers used in the PCR reaction are indicated by F (forward) and R (reverse). 
The locations of the identified variations are indicated (arrows). 


for these polymorphisms, which have no appar¬ 
ent functional consequence, as they are located 
in a non-coding intron region of the APN gene. 
The fourth observed intron polymorphism, a C to T 
variation at intron position 389 (C389T) was found 
in association with three variations in exon 4: an 
already described C to T polymorphism in codon 
311 (GenBank XI3276, mRNA, nt position 956), 24 
leading to an alanine to valine substitution, and 
two silent mutations, a G to T variation in codon 
318 (nt 978), and a G to A variation in codon 321 (nt 
987). Fifteen heterozygotes and one homozygote 
for these four polymorphisms were found in the 
group of 100 Belgian individuals. In one of these 
fifteen heterozygous individuals, the intron varia¬ 
tion C429T was found. No deviations from expec¬ 
tations under Hardy-Weinberg assumptions were 
observed. 

Haplotypes were determined by sequence anal¬ 
ysis of cloned PCR-amplicons in samples of indi¬ 
viduals heterozygous for more than one variation. 
Five haplotypes were identified, and their frequen¬ 
cies are shown in Table 2. The four linked varia¬ 
tions, identified in samples of 16 individuals, were 
demonstrated to be present on the same allele, 
and this haplotype was detected with a frequency 
of 8.5%. 


Discussion 

In this study we analyzed an 871 bp region of the 
APN gene, encompassing the full-length APN exon 
3, intron 3 and exon 4. In our search for polymor¬ 
phisms in the APN domain that is essential for its 
HCoV-229E receptor function, we identified seven 
polymorphisms, of which four were located in the 
non-coding intron 3. In 25% of the Belgian individ¬ 
uals one or more of these seven variations were 
found. Haplotype analysis revealed the presence of 
five haplotypes in the population. Three polymor¬ 
phisms in APN exon 4 (C956T, G978T and G987A) 
in association with an intron 3 variation (C389T), 
were identified at a relatively high allele frequency 
(8.5%) in our Belgian population. Fifteen heterozy¬ 
gotes and one homozygote for these four variations 
were observed. One of the exon 4 single nucleotide 
polymorphisms is a C to T mutation, C956T, lead¬ 
ing to an amino acid change in codon 311. This ala¬ 
nine to valine substitution, a conservative amino 
acid change, has already been described in a pop¬ 
ulation of Italian coeliac disease patients, in which 
it was observed with an allele frequency of 7.9%. 24 
In our population, this polymorphism was approx¬ 
imately equally frequent (8.5%). The C956T poly¬ 
morphism was always found in combination with the 


Table 2 Distribution of the APN haplotypes. 


Intron 3 a 



Exon 4 b 



Frequency 


86 

389 

395 

429 

956 

978 

987 

n 

% 

C 

C 

G 

C 

C 

G 

G 

173 

86.5 

C 

T 

G 

C 

T 

T 

A 

17 

8.5 

C 

C 

C 

C 

C 

G 

G 

7 

3.5 

T 

C 

G 

C 

C 

G 

G 

2 

1.0 

C 

c 

G 

T 

c 

G 

G 

1 

0.5 


a Variations in intron 3 are referred to their position in intron 3. 

b Variations in exon 4 are referred to their mRNA position (GenBank accession number XI3276). 
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two other observed exon 4 variations (G978T and 
G987A) and one intron 3 variation (C389T). Hap¬ 
loid DNA sequencing demonstrated the presence of 
these four variations on the same allele. Here we 
report for the first time an association between the 
C956T polymorphism and other variations in the APN 
gene. Absolute linkage between the SNPs within 
this short physical distance is commonly observed. 
Apart from the codon 311 variation (C956T), none 
of the other associated polymorphisms has an ap¬ 
parent functional effect, as the codon 318 and 321 
variations (G978T and G987A) are silent, and C389T 
is located in an intron. 

Recent studies revealed that the human APN gene 
is subject to alternative splicing, and two splice 
variants, in which exon 3 and exon 14 were lost, 
have been identified. 14 Alternative splicing is a 
mechanism that allows different protein isoforms to 
be created from a single gene. The splicing process 
is regulated by both cis- and trans-acting factors, 
which control the choice of 5' or 3' splice sites either 
positively or negatively. Cis-acting regulatory ele¬ 
ments include exonic or intronic splice enhancers 
or silencers, which can be changed by mutations 
with a possible effect on mRNA splicing. 25,26 In this 
perspective, the importance of silent mutations or 
intron variations, which seem to have no apparent 
functional effect, should be reconsidered. Support¬ 
ing this assumption, a silent mutation (C77G) in 
exon 4 of the human protein-tyrosine phosphatase 
CD45 gene has been shown to increase the ex¬ 
pression of an aberrant exon 4-included CD45 by 
disrupting the activity of an exonic splice silencer. 
An association of this silent mutation with multiple 
sclerosis has been suggested. 27 In this way, it might 
be possible that the silent mutations in APN exon 4 
and the APN intron 3 variations that we observed in 
our population could be of functional importance. 

The alanine to valine substitution in codon 311, 
caused by the single nucleotide polymorphism 
C956T, is an amino acid change occurring in the 
APN domain that is essential for its HCoV-229E re¬ 
ceptor function. Although this amino acid variation 
is rather conservative, an effect on the HCoV-229E 
binding capacity of aminopeptidase N could be 
possible. Since the three-dimensional structure of 
APN has not yet been described, molecular mod¬ 
eling studies of the whole protein could not be 
performed. 

Further research is needed to elucidate the bi¬ 
ological relevance of the APN variations described 
here. In vitro studies should be carried out includ¬ 
ing in vitro transcription, translation and viral bind¬ 
ing assays of the described APN gene variants to 
detect a possible functional effect. Furthermore, a 
virus susceptibility test can be performed involving 


healthy volunteers homozygous or heterozygous for 
these variations. HCoV-229E is a relatively innocu¬ 
ous agent and it would be possible to determine 
whether these polymorphisms could affect suscep¬ 
tibility to infection. 

Given the substantial role of aminopeptidase N 
in several domains, including disease therapy and 
diagnosis, studies analyzing the APN gene and the 
functional consequences of possible gene variations 
will be of importance in these and future applica¬ 
tions. 
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